# Large Model Fine-tuning
Shisa V2 Llama3.3 70b
Shisa V2 is a bilingual (Japanese/English) general-purpose chat model series trained by Shisa.AI, optimized based on Llama-3.3-70B-Instruct, focusing on improving Japanese task performance while maintaining English capabilities.
Large Language Model
Transformers Supports Multiple Languages

S
shisa-ai
144
2
Whisper Large V3 Persian Common Voice 17
Apache-2.0
A Persian automatic speech recognition model fine-tuned based on Whisper Large v3, trained on the Common Voice 17 dataset, which contains over 250,000 Persian audio samples, significantly improving recognition accuracy and robustness.
Speech Recognition
Transformers

W
MohammadGholizadeh
978
3
Modernbert Large Msmarco Bpr
This is a sentence-transformers model fine-tuned from ModernBERT-large, designed to map sentences and paragraphs into a 1024-dimensional dense vector space, supporting tasks such as semantic textual similarity and semantic search.
Text Embedding
M
BlackBeenie
21
2
Japanese Wav2vec2 Large Rs35kh
Apache-2.0
A Japanese automatic speech recognition model fine-tuned on the large-scale Japanese ASR corpus ReazonSpeech v2.0, based on the wav2vec 2.0 Large architecture
Speech Recognition
Transformers Japanese

J
reazon-research
244
1
Bmretriever 7B
MIT
BMRetriever is a 7-billion-parameter large language model specifically optimized for biomedical text retrieval tasks, capable of efficiently handling literature retrieval needs in the medical and biological fields.
Large Language Model
Transformers English

B
BMRetriever
81
5
Phind CodeLlama 34B Python V1
A large language model fine-tuned based on CodeLlama-34B-Python, achieving 69.5% pass@1 on HumanEval, surpassing GPT-4's performance
Large Language Model
Transformers

P
Phind
878
253
T5 11b Trueteacher And Anli
TrueTeacher is a factual consistency evaluation model based on the T5-11B architecture, specifically designed to assess factual consistency in summaries.
Large Language Model
Transformers English

T
google
444
16
Wav2vec2 Large Xls R 300m Pt Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the common_voice_9_0 dataset based on facebook/wav2vec2-xls-r-300m, supporting Portuguese speech-to-text tasks.
Speech Recognition
Transformers

W
robertodtg
107
0
Wav2vec2 Large Xlsr 53 Enlgish FT ASCEND Colab
Apache-2.0
This model is a fine-tuned speech recognition model based on jonatasgrosman/wav2vec2-large-xlsr-53-english on the ascend dataset.
Speech Recognition
Transformers

W
Ryna
16
0
Longt5 Tglobal Large 16384 Pubmed 3k Steps
Apache-2.0
LongT5 is a long-sequence text-to-text Transformer model based on T5, employing transient-global attention mechanism, suitable for long-text processing tasks.
Text Generation English
L
Stancld
1,264
22
Wav2vec2 Large Xlsr Persian V3
Apache-2.0
This is a Persian speech recognition model fine-tuned on the Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.
Speech Recognition
Transformers

W
masoumehb
21
0
Xlsr Wav2vec2 1
Apache-2.0
A speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting multilingual speech-to-text tasks
Speech Recognition
Transformers

X
chrisvinsen
20
0
V1 Speech Processing Project Wav2vec2
Apache-2.0
This model is a fine-tuned speech processing model based on wav2vec2-large-xls-r-300m-Urdu, primarily used for Urdu speech recognition tasks.
Speech Recognition
Transformers

V
Raffay
23
0
20220415 210530
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-2b on the common_voice dataset
Speech Recognition
Transformers

2
lilitket
20
0
20220412 203254
Apache-2.0
This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the common_voice dataset, supporting automatic speech recognition tasks.
Speech Recognition
Transformers

2
lilitket
18
0
Wav2vec2 Large Xls R 300m Russian Colab Beam Search Test
Apache-2.0
This model is a Russian speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-xls-r-300m, achieving a word error rate of 0.468 on the evaluation set.
Speech Recognition
Transformers

W
jfealko
18
0
Wav2vec2 Xls R Tf Left Right Shuru
Apache-2.0
A speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, achieving a word error rate (WER) of 1.2628 on the evaluation set.
Speech Recognition
Transformers

W
hrdipto
29
0
Wav2vec2 Large Xls R 300m Marathi
Apache-2.0
This is a speech recognition model fine-tuned on Marathi datasets based on the facebook/wav2vec2-xls-r-300m model.
Speech Recognition
Transformers Other

W
ravirajoshi
26
1
Biom ALBERT Xxlarge PMC
Large-scale biomedical language models based on BERT, ALBERT, and ELECTRA, achieving state-of-the-art results in multiple biomedical tasks
Large Language Model
Transformers

B
sultan
189
4
Wav2vec2 Large Xls R 300m Ar
Apache-2.0
A speech recognition model fine-tuned on the Common Voice Arabic dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
ayameRushia
18
0
Wav2vec2 Xl 960h Dementiabank
Apache-2.0
This model is a speech recognition model fine-tuned on the DementiaBank dataset based on facebook/wav2vec2-large-960h, primarily used for speech-to-text tasks.
Speech Recognition
Transformers

W
shields
20
0
Wav2vec2 Xls R 300m Hy AM CV8 V1
Apache-2.0
A speech recognition model fine-tuned on a general speech dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers

W
emre
17
0
Wav2vec2 Xls R 300m Japanese
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Japanese Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m, supporting Japanese speech-to-text functionality.
Speech Recognition
Transformers Japanese

W
AndrewMcDowell
24
0
Xls R 1b Ur
Apache-2.0
An Urdu automatic speech recognition (ASR) model fine-tuned from Facebook's wav2vec2-xls-r-1b model, trained on the Common Voice 8.0 Urdu dataset
Speech Recognition
Transformers Other

X
HarrisDePerceptron
21
0
Wav2vec2 Xls R 1b Npsc Bokmaal
Apache-2.0
An automatic speech recognition model fine-tuned on the Norwegian written language (Bokmål) speech dataset based on the facebook/wav2vec2-xls-r-1b model
Speech Recognition
Transformers

W
NbAiLab
23
0
Featured Recommended AI Models